-
Notifications
You must be signed in to change notification settings - Fork 0
Conversation
Default von max bei motif_filtering bei mir ist 200, nicht 100. Hatte sich geändert. |
In the last code section, when you mention the rare moods bug, maybe add, that the pipeline (in step 1.2) checks if those files are empty and returns an error message if so. Or is this clear? |
README.md
Outdated
this can be done with following commands: | ||
``` | ||
export PATH=[meme-suite instalation path]/libexec/meme-[meme-suite version]:$PATH | ||
export PATH=[meme-suite instalation path]/bin:$PATH | ||
``` | ||
|
||
Every other dependency will be automatically installed by Nextflow using conda. For that a new conda enviroment will be created, which can be found in the from Nextflow created work directory after the first pipeline run. | ||
It is **not** required to create and activate the enviroment from the yaml-file beforehand. | ||
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Two whitespaces after 'automatically'.
README.md
Outdated
Filter unknown motifs: | ||
--min_size_fp INT Minimum sequence length threshold. Smaller sequences are discarded. (Default: 10) | ||
--max_size_fp INT Maximum sequence length threshold. Discards all sequences longer than this value. (Default: 100) | ||
|
||
--tfbsscan_method [moods|fimo] Method used by tfbsscan. (Default: moods) | ||
Clustering: | ||
Sequence preparation/ reduction: | ||
--kmer INT Kmer length (Default: 10) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
*K-mer length
README.md
Outdated
Clustering: | ||
--global INT Global (=1) or local (=0) alignment. (Default: 0) | ||
--identity FLOAT Identity threshold. (Default: 0.8) | ||
--sequence_coverage INT Minimum aligned nucleotides on both sequences. (Default: 8) | ||
--memory INT Memory limit in MB. 0 for unlimited. (Default: 800) | ||
--throw_away_seq INT Remove all sequences equal or below this length before clustering. (Default: 9) | ||
--strand INT Align +/+ & +/- (= 1). Or align only +/+ (= 0). (Default: 0) | ||
|
||
Motif estimation: | ||
--min_seq INT Sets the minimum number of sequences required for the FASTA-files given to GLAM2. (Default: 100) | ||
--motif_min_key INT Minimum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 8) | ||
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2.f (Default: 20) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
GLAM2.f?
README.md
Outdated
Clustering: | ||
--global INT Global (=1) or local (=0) alignment. (Default: 0) | ||
--identity FLOAT Identity threshold. (Default: 0.8) | ||
--sequence_coverage INT Minimum aligned nucleotides on both sequences. (Default: 8) | ||
--memory INT Memory limit in MB. 0 for unlimited. (Default: 800) | ||
--throw_away_seq INT Remove all sequences equal or below this length before clustering. (Default: 9) | ||
--strand INT Align +/+ & +/- (= 1). Or align only +/+ (= 0). (Default: 0) | ||
|
||
Motif estimation: | ||
--min_seq INT Sets the minimum number of sequences required for the FASTA-files given to GLAM2. (Default: 100) | ||
--motif_min_key INT Minimum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 8) | ||
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2.f (Default: 20) | ||
--iteration INT Number of iterations done by glam2. More Iterations: better results, higher runtime. (Default: 10000) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here is glam2 all lowercase before all uppercase.
README.md
Outdated
When the environment is created, set the variable 'path_env' in the configuration file as the path to it. | ||
|
||
For unknown reasons Moods, whisch is called by tfbsscan, rarely returns empty bedfiles. If this happens The pipeline stops. | ||
The is no known fix sofar. If it happens try starting the pipeline again. If it still fails. Change the parameter tfbsscan_method to 'fimo'. This methods takes longer but will cause no error. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
*There is
*so far
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As a workaround either restart the pipeline or change the parameter tfbsscan_method
to fimo
. (Instead of the third and second to last sentences).
Can you give me the error message? then iI will add it |
The Parameter list is still not p to date. Please do not merge before it is. |
if [ $all_empty == true ] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should reformat the Parameters page to one format.
And its called enviroNment with n :D
README.md
Outdated
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository. | ||
It is required to create and activate the enviroment from the yaml-file beforehand. | ||
This can be done with following commands: | ||
```condsole |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
console
README.md
Outdated
|
||
**Important Note:** For conda the channel bioconda needs to be set as highest priority! This is required due to two different packages with the same name in different channels. For the pipeline the package jellyfish from the channel bioconda is needed and **NOT** the jellyfish package from the channel conda-forge! | ||
|
||
|
||
|
||
## Quick Start | ||
```console | ||
nextflow run pipeline.nf --bigwig [BigWig-file] --bed [BED-file] --genome_fasta [FASTA-file] --motif_db [MEME-file] --config [UROPA-config-file] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
organism ?!
README.md
Outdated
Optional arguments: | ||
|
||
--help [0|1] 1 to show this help message. (Default: 0) | ||
--tfbs_path Path to directory with output from tfbsscan. If given tfbsscan will not be run. | ||
--create_known_tfbs_path Path to directory where output from tfbsscan (known motifs) are stored. | ||
Path can be set as tfbs_path in next run. (Default: './') | ||
--gtf_path Path to gtf-file. If path is set the process which creats a gtf-file is skipped. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arguments should be in one Format: With Datatype e.g. INT or without, not mixed within the document.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If its a list of strings which are allowed I prefer to write the list instead of 'STRING'. I could add 'STRING' to every path variable but I don't think that is necessary.
README.md
Outdated
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2.f (Default: 20) | ||
--iteration INT Number of iterations done by glam2. More Iterations: better results, higher runtime. (Default: 10000) | ||
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 20) | ||
--iteration INT Number of iterations done by GLAM2. More Iterations: better results, higher runtime. (Default: 10000) | ||
--tomtom_treshold float Threshold for similarity score. (Default: 0.01) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Datatypes all-caps or not -> only one format should be used
README.md
Outdated
Start with installing all dependencies listed above (Nextflow, conda, MEME-Suite) and downloading all files from the [GitHub repository](https://github.molgen.mpg.de/loosolab/masterJLU2018). | ||
It is required to set the [enviroment paths for meme-suite](http://meme-suite.org/doc/install.html?man_type=web#installingtar). | ||
1. Start with installing all dependencies listed above (Nextflow, conda, MEME-Suite) and downloading all files from the [GitHub repository](https://github.molgen.mpg.de/loosolab/masterJLU2018). | ||
2. It is required to set the [enviroment paths for meme-suite](http://meme-suite.org/doc/install.html?man_type=web#installingtar). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
enviroNment not enviroment
README.md
Outdated
this can be done with following commands: | ||
``` | ||
export PATH=[meme-suite instalation path]/libexec/meme-[meme-suite version]:$PATH | ||
export PATH=[meme-suite instalation path]/bin:$PATH | ||
``` | ||
|
||
Every other dependency will be automatically installed by Nextflow using conda. For that a new conda enviroment will be created, which can be found in the from Nextflow created work directory after the first pipeline run. | ||
It is **not** required to create and activate the enviroment from the yaml-file beforehand. | ||
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
enviroNment
README.md
Outdated
Every other dependency will be automatically installed by Nextflow using conda. For that a new conda enviroment will be created, which can be found in the from Nextflow created work directory after the first pipeline run. | ||
It is **not** required to create and activate the enviroment from the yaml-file beforehand. | ||
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository. | ||
It is required to create and activate the enviroment from the yaml-file beforehand. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
enviroNment
I think we needed the better explanation for the problem with MOODS
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added the better explanation for the issue with MOODS
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
here are the changes for the known issue with moods
…olab/masterJLU2018 into estimation_motifs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks fine (for now :D)
No description provided.